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Abstract. We consider approximating the minmax value of a multi- 
player game in strategic form. Tightening recent bounds by Borgs et 
al, we observe that approximating the value with a precision of elogn 
digits (for any constant e > 0) is NP-hard, where n is the size of the 
game. On the other hand, approximating the value with a precision of 
clog log n digits (for any constant c > 1) can be done in quasi-polynomial 
time. We consider the parameterized complexity of the problem, with the 
parameter being the number of pure strategies k of the player for which 
the minmax value is computed. We show that if there are three players, 
k = 2 and there are only two possible rational payoffs, the minmax 
value is a rational number and can be computed exactly in linear time. 
In the general case, we show that the value can be approximated with 
any polynomial number of digits of accuracy in time n°^. On the other 
hand, we show that minmax value approximation is lT[l]-hard and hence 
not likely to be fixed parameter tractable. Concretely, we show that if 
k-CLIQUE requires time n nfk > then so does minmax value computation. 

1 Introduction 

A game G in strategic form between I players is given by a set of players 
{1, ■■•,1} and for each player j a finite strategy space Sj and a utility func- 
tion Uj : Si x S2 x • • ■ x Si — > R. In this paper, only the utility function for 
Player 1 is relevant. When the size of Sj is nj, we shall refer to the game as 
an m x ri2 x • • • x ni game. The minmax (or threat) value of G for Player 1 is 
given by min^ .g/jci-i) maxagSj E[ui(a, U-i)] where A^ 1 ^ is the set of mixed, 
but uncorrelated, strategy profiles for players 2,...,L A profile <r_i achiev- 
ing the minimum in the expression is called an optimal minmax profile or an 
optimal threat. The maxmin (or security) value of G for Player 1 is given by 
max ( j 1 ^/\ uim a2 _ a; E[ui(ai, a%, . . . , a ; )] where A is the set of mixed strategies 
for Player 1. 

The minmax value of a finite two-player game is a fundamental notion of 
game theory. Its mathematical and computational properties are extremely well- 
studied and well-understood, being intimately tied to the theory of linear pro- 
gramming. In particular, the duality theorem of linear programs implies that the 
minmax value equals the maxmin value. Also, the computation of the minmax 
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value of a two-player game in strategic form is essentially equivalent to solv- 
ing linear programs and can therefore be done in polynomial time (although a 
strongly polynomial time algorithm remains an open problem). 

Minmax values of multi-player games are much less well-studied, although 
these values are arguably also of fundamental interest to game theory. Most 
importantly the minmax value pays a pivotal role in the statement and proof 
of the so-called folk theorem that characterize the Nash equilibria of infinitely 
repeated games. Additionally, the minmax value is the equilibrium payoff of the 
so-called team-maxmin equilibria studied by von Stengel and Koller [15]. For a 
multi-player game, the maxmin value may be strictly smaller than the minmax 
value. Computation of the maxmin value easily reduces to the two-player case 
and can therefore be done efficiently using linear programming. Rather surpris- 
ingly computation of the minmax value of a multi-player game in strategic form 
was not studied until very recently, where Borgs et al. [1] (motivated by com- 
putational aspects of the folk theorem) showed that approximating the minmax 
value of a three-player game within a certain inverse polynomial additive error 
is NP-hard. Our starting point is this important paper. 

Given the fundamental nature of the notion of the minmax value, it is im- 
portant to understand when the NP-hardness result can be circumvented by 
considering special cases or asking for weaker approximations. The purpose of 
this paper is to provide a number of results along these lines. First, we observe 
that the inapproximability result of Borgs et al. can be tightened and matched 
with a positive result, using standard techniques: 

Theorem 1. For any constant e > 0, approximating the minmax value of an 
n x n x n game with 0-1 payoffs within additive error l/n e is NP-hard. On 
the other hand there is an algorithm that, given a parameter e > and a game 
in strategic form with I players each having n strategies and all payoffs being 
between and 1, approximates the minmax value for Player 1 from above with 
additive error at most e in time n ^ losn ^ e \ 

This suggests the following important problem: Can the minmax value of a 
three-player game with payoffs normalized to [0, 1] be approximated within a fixed 
constant (say 0.01 or even 0.99) in polynomial (rather than quasi-polynomial) 
time? We leave this problem open. 

It is of interest to know when the minmax value can be computed exactly. A 
prerequisite for this is that it is rational. For three-player games, we characterize 
when the minmax value for Player 1 can be an irrational number, in terms of the 
number of strategies of Player 1 and the number of distinct (rational) payoffs. 
For the special case where the value is guaranteed to be rational we present an 
optimal linear time algorithm for exactly computing the minmax value 1 . 

1 As the algorithms of Theorem 1 and Theorem 2 are very simple, we express their 
complexity in the unit cost random access machine model. E.g., by "linear time" we 
mean a linear number of atomic operations in the number of real payoffs of the input. 
On the other hand, the algorithm of Theorem 3 use sophisticated algorithms from 
the literature as subroutines and its complexity is better expressed in the Turing 
machine model, and in terms of bit complexity. 



Theorem 2. Consider kxnxn three-player games with only I distinct rational 
payoffs. When either k > 2 and I > 3 or k > 3 and I > 2 there exists a game 
such that the minmax value for Player 1 is irrational. Otherwise, when k — 2 and 
I = 2 the minmax value for Player 1 is a rational number and we can compute 
it exactly in time 0(n 2 ) (on a unit cost random access machine model). 

Thus having observed that the case of few strategies of Player 1 may be easier 
than the general case, we apply the approach of parameterized complexity [6], 
considering the number of strategies k of Player 1 as the parameter. Combining 
a classical result of Shapley and Snow [14] with Renegar's decision procedure for 
the first order theory of the reals [11,12,13] gives rise to a support enumeration 
algorithm for finding the minmax value and we show 

Theorem 3. Given a k x n x • • • x n l-player game G with rational payoffs 
and a rational number a so that (G, a) has combined bit complexity L, we can 
decide in time L ^k°^ kl ^n kl (on a Turing machine) if the minmax value of G 
for Player 1 is at most a. Using the terminology of fixed parameter complexity 
theory, considering k the parameter, a modification of the algorithm shows that 
this problem is in W{P], and for the case of 0-1 payoffs in W[l\. 

In particular, if I and k are constants, the complexity is polynomial, and we can 
approximate the minmax value with any polynomial number of bits of accuracy 
in polynomial time by using the decision procedure in a binary search. As the 
exponent in the above complexity bound depends linearly on k with impractical 
bounds for large k as consequence, we next ask if the problem of approximating 
the minmax value for Player 1 in a three-player game is fixed parameter tractable, 
i.e., if an algorithm solving the problem in time f(k)n c exists, where / is any 
function and c is a constant not depending on k. We provide a reduction from 
fc-CuQUE that gives negative evidence. 

Theorem 4. Let G = (V,E) be an undirected graph with \V\ = n, then k- 
Clique reduces to the problem of approximating the threat value for Player 1 
within l/(4fc 2 ) in a three-player 2k x kn x kn game with payoffs and 1. 

Downey and Fellows [8] proved that the Clique problem is complete for the 
class W[l], and hence it immediately follows that the problem of approximating 
the minmax value within l/(4fc 2 ) for Player 1 in a 2k x kn x kn game with k 
being a parameter is hard for W[l], even when all payoffs are or 1. Combining 
this with Theorem 3, we in fact have that the 0-1 case is VF[l]-complete. Readers 
not well- versed in the theory of parameterized complexity may find the following 
consequence of the reduction more appealing: The minmax value of a 2k x kn x kn 
three-player game with 0-1 payoffs cannot be approximated in time n°( k \ unless 
/c-Clique can be solved in time n o(k K If /c-Clique could be solved in time n o(k ^ 
then as proved by Chen et al. [3] it would follow that all problems in the class 
SNP (e.g. 3-SAT) could be solved in time 2°( n \ Thus, under the assumption 
that all of SNP cannot be solved in time 2°( n \ the algorithm of Theorem 3 is 
essentially optimal for the case of 0-1 payoffs, in the sense that its complexity is 
n °( k ) anc J n °( k ) j s a lower bound. 



2 Proofs 



2.1 Proof of Theorem 1 

Wc first prove the hardness claim. Borgs et al. showed hardness of approximation 
with additive error 3/n 2 . Now consider, for a positive integer c > 2 the following 
"padding" construction: Given an n x n x n game G with strategy space Si 
of Player i and utility function u\ for Player 1. Let n' = n c and define the 
n' x n' x n' game G' with strategy space S[ = Si x {1, . . . , n c_1 } and utility 
function for Player 1 being u' x {{x, a\), (y, 02), (z, (X3)) = ui(x,y,z). In words, 
G' is simply G with each strategy copied n c_1 times. Now, G' and G clearly 
have the same minmax value. Also, for a given e > 0, by picking c to be a 
large enough constant, we can ensure that l/{n') e < 3/n 2 , so approximating the 
minmax value of G within 3/n 2 reduces to approximating the minmax value of 
G' within l/(n') e , which concludes the proof of hardness (we remark that this 
simple padding argument also yields a somewhat simpler proof of Lemma 7.1 of 
Chen, Teng and Valiant [4]). 

We now proceed with the positive approximation result. We only show the 
result for the case of three players; the general case being very similar. For the 
proof, we will use the following Theorem by Lipton and Young [10, Theorem 2]: 

Theorem 5. For a two-player zero-sum n x n game with payoffs in [0, 1], there 
is a simple strategy for each player that guarantees that player a payoff within e 
of the value of the game. Here, a simple strategy is one that mixes uniformly on 
a multiset of |~lnn/(2e 2 )] strategies. 

Now consider a given 3-player game G and consider the optimal threat strategy 
profile (0-2,(73) of Players 2 and 3 against Player 1. Consider 03 as fixed and look 
at the resulting two-player game G' between Player 1 (maximizer) and Player 2 
(minimizcr). Clearly, this game has value equal to the minmax value for Player 
1 in G. Applying Theorem 5, there is a simple strategy a 2 for Player 2 that 
guarantees this value within e. Fix a' 2 to this strategy and look at the resulting 
two-player game G" between Player 1 and Player 3. By construction of a' 2 , this 
game has value at most e larger than the value of G'. Applying Theorem 5 again, 
there is a simple strategy a 2 for Player 2 that guarantees this value within e. 
Thus, if Player 2 and Player 3 play the profile {o~ 2 ,a' 3 ), in the original game, 
they are guaranteed the minmax value of G plus at most 2e. 

Now, given some e', we let e = e'/2 and approximate the threat value of 
Player 1 within e' by exhaustively searching through all pairs of simple strate- 
gies for Player 2 and Player 3, compute the maximum payoff for Player 1, and 
return the lowest such payoff. This completes the proof of the theorem. 

It is natural to ask if one can get any non-trivial approximation by consid- 
ering strategies that mix uniformly over only a constant size multiset, as this 
would lead to a polynomial time approximation algorithm rather than a quasi- 
polynomial one. Unfortunately, the answer is negative: For given n and c, let m 
be maximal such that (™) < n. Consider the (™) x to x m game G defined as 



follows. For every two subsets of actions of size c for Player 2 and Player 3 there 
is an action for Player 1 such that he receives payoff 1 if the other players chose 
an action from the two subsets. Otherwise Player 1 receives payoff 0. If Player 
2 and Player 3 choose their actions uniformly at random, Player 1 can ensure 
payoff at most (^) 2 , while on the other hand if the strategies of Player 2 and 
Player 3 has support size at most c, then Player 1 can ensure payoff 1. We can 
now, in a similar way as in the proof of Theorem 1 above, construct a padded 
version of the game, obtaining a n x n x n game G' such that the minmax value 
for Player 1 is at most ( — ) 2 < ^ 2e j , but for every strategy profile for Player 2 

n c 

and Player 3 of support size at most c, Player 1 can ensure payoff 1. Thus to 
approximate the minmax value within, say, \ we must have c > jlg"^ • 



2.2 Proof of Theorem 2 



First, we give the claimed examples of games for which the minmax value for 
Player 1 is irrational. We describe the games by a matrix for each action of 
Player 1, where rows and columns correspond to the actions of Player 2 and 
Player 3, respectively. That is, we let ui(i,j, k) — Ai(j, k). 

The first game is a 2 x 2 x 2 game where there are 3 distinct payoffs, given 
by the following matrices. 



A l = 



1 





Ao 



00 
2 



"10" 




'0 0" 




'0 1" 





A 2 = 


1 1 


A 3 = 


1 



It is easy to see that the strategy for Player 2 and Player 3 yielding the minmax 
value for Player 1 is to play the first action with probability 2 — \/2- This results 
in a minmax value of 6 — 4-^/2 for Player 1 . 

The second game is a 3 x 2 x 2 game where there are 2 distinct payoffs, given 
by the following matrices. 



It is easy to sec that the strategy for Player 2 and Player 3 yielding the minmax 
value for Player 1 is to play the first action with probability v/ ^~ 1 (golden ratio 
conjugate). This results in a minmax value of for Player 1. 

We now examine the special case where there are only two distinct payoffs, 
and Player 1 only has two possible actions. For this case, we show that the threat 
point is always rational valued, and the threat strategies can be computed in 
linear time. We assume without loss of generality that the two possible payoffs 
are and 1. The proof is a case analysis, where each case can be identified and 
solved in linear time, assuming that none of the previous cases apply. Case 1 
and 2 are the trivial cases where either side has a pure optimal strategy. 
Case 1: 3i\/j,k : u±(i,j,k) — 1. Player 1 has a "safe" action, i, such that 
no matter what Players 2 and 3 does, Player 1 achieves value 1. Any strategy 
profile for Players 2 and 3 is an optimal threat, with minmax value 1. 



Case 2: 3j, fcVi : Ux(i,j, k) — 0. Players 2 and 3 have a pure strategy threat, 
such that no matter what Player 1 does, the payoff is 0. The strategy profile 
(j, k) is an optimal threat, with minmax value 0. 

Case 1 and Case 2 can easily be identified and solved in linear time. Notice that 
when we are not in case 2, we have that Vj, k : 3i : u\(i,j, k) = 1, and therefore 
Ui(i,j,k) = => ui(i',j,k) = 1 where i' ^ i. This means that Player 1 has a 
maxmin (security) value of at least \ , which can be achieved by a uniform mix 
of the two strategies. As the minmax (threat) value has to be greater than the 
maxmin value, any threat with minmax value \ will be optimal. This is exactly 
what can be achieved in the next two cases: 

Case 3: 3j\/i3k : U\ (i, j, k) = 0. Player 2 has a pure strategy, such that Player 
3 can play matching pennies against Player 1. Let k and k' be the strategies of 
Player 3 achieving payoff against i and i' respectively. Then (j, (ifc, \k')) is 
an optimal threat, with minmax value of i. 

Case 4: 3k\/i3j : U\ (i, j, k) = 0. Player 3 has a pure strategy, such that Player 

2 can play matching pennies against Player 1. Let j and j' be the strategies of 
Player 3 achieving payoff against i and i' respectively. Then k) is an 
optimal threat, with minmax value of ^. 

Case 3 and Case 4 can again easily be identified and solved in linear time. 
Case 5: None of the above. The negation of case 1 implies Vi3j, k : ui(i,j, k) = 
0. The negation of cases 2, 3 and 4 implies ui(i,j, k) = ^> Vj', k! : u\(i' ,f, k) = 
Ui(i',j,k') = 1, where i' ^ i. That is, any strategy profile of Player 2 or 3 can 
achieve payoff against at most one of Player 1 's strategies, and Players 2 and 

3 must agree on which strategy of Player 1 to try to get payoff against; if they 
disagree, the payoff is 1 no matter what Player 1 does. The best they can hope 
for is therefore min Pi(?e [o ; i] max{l — pq, 1 — (1 — p)(l — q)} 7 which gives a lower 
bound on the minmax value of |. This value can be achieved in this case in the 
following way: let u\(i,j,k) = Ui(i',j',k') = 0. Then ((|j, (|fc, \k')) is an 
optimal threat, with minmax value of |. 

2.3 Proof of Theorem 3 

We prove the theorem for three players, the general case is similar. Shapley and 
Snow [14] showed that every k x n zero-sum game has a minmax mixed strategy 
for Player 2 of support at most fc, i.e., using at most k strategies. We claim that 
from this it follows that in every kxnxn game there are strategies for Player 2 
and Player 3 of support at most k so that the resulting strategy profile <r_i yields 
the minmax value for Player 1 when Player 1 chooses a best response. Indeed, 
consider the actual minmax strategy profile = (cr^ 17 ^)- If we consider 0-3 
fixed and consider the resulting two-player game between Player 1 and Player 
2, it is clear that is a minmax strategy of this game and that Player 2 will 
still guarantee the minmax payoff by playing the minmax strategy of support 
k which is guaranteed to exist by Shapley and Snow's result. Similarly, we may 
replace CT3 with a strategy of support k without changing the payoff resulting 
when Player 1 plays a best response. 



Our algorithm is a support enumeration algorithm which exhaustively exam- 
ines each possible support of size k for Player 2 and Player 3. From the above 
observation it follows that the minmax value of the game is the minimum of the 
minmax value of each of the resulting k x k x k subgames. Therefore, we only 
have to explain how to compare the minmax value of such a subgame to a given 
a, and we will be done. For this, we appeal to classical results on the first order 
theory of the reals. 

The decision procedure for the first order theory of the reals due to Renegar 
[11,12,13] can decide a sentence with uj— 1 quantifier alternations, the kth block 
of variables being of size n^, containing m atomic predicates and involving only 
polynomials of degree at most d with integer coefficients of maximum bit length 
L using J L(log J L)2°( 1 °s* i )(md) 2 ° ( " ) ^ ,lk bit operations 2 and {md)°^ n ^ eval- 
uations of the Boolean formula of the atomic predicates. We claim that from this 
it follows that given a k x k x k game G with rational payoffs and a rational 
number a so that (G, a) has combined bit complexity L, we can decide in time 
L(\og L)2°( los L ) k°^ (on a Turing machine) if the minmax value of G for Player 
1 is at most a. We can assume that the payoffs and a are integers at the expense 
of increasing the bitlcngth of every number to at most the combined bitlength 
of the original problem. Define the following polynomials in 2k variables. 

k 2k 

pi(x l7 ...,x 2 k)=^2 Y u 1 (l,i,j)x i x j , r i (x 1 ,...,X2k)=x i 

i=l j=k+l 
k 2k 

q 1 (x 1 ,...,x 2 k) = '^2x i , q 2 (x-L,...,x 2 k) = Y Xl 

i=l i=fe+l 

The sentence we must decide is then 

(3x e R 2fc )[pi(x) < a A • • • Ap fc (x) < a A q\{x) = 1 A q 2 (x) = 1 

Ari(x) > OA ••• Ar 2fc (x) > 0]. 

For this sentence we have w = 1, m = 3fc+2, d = 2 and n\ = 2fc, and the sentence 
can thus be decided in the claimed running time using Renegar's procedure. For 
the support enumeration algorithm this decision procedure must be invoked for 
(^) different kx kx k subgames, and the claimed time bound of the statement 
of the theorem follows. 

Next, we show how to use this algorithm to show W[P] and W[l] membership 
of the two versions of the problem. We use the framework of aftp-programs of 
Chen, Flum and Grohe [5] and Buss and Islam [2] to do this (details of this 
model and its relationship to parameterized complexity are given in Appendix 
A) . To transform the algorithm into an aftp-program showing that the decision 
problem is in the class VF[P] , we simply replace the enumeration by an existential 
steps guessing the sets of indices of size k giving the support of the strategies 



2 The bound stated here is the improvement of the bound stated by Renegar due to 
the recent breakthrough in integer multiplication due to Fiircr [9] . 



of Player 2 and Player 3. In the remainder of this section we will show that for 
the special case 0-1 payoffs the decision problem is in the class W[l]. The idea is 
to precomputc, for every possible k x k x k game with 0-1 payoffs, whether the 
minmax value for Player 1 is at most a. As in the W[P] case, indices of support 
of strategies is computed but now the k x k x k subgame is used as an index 
in the precomputed table. To see that this can be turned into an appropriate 
aftp-program, we will formally define the relations used. 

Assume that the payoffs of Player 1 are given as a fc-tuple ofmxnO-1 matrices 
(U 1 , . . . , U k ). Define a unary relation A over fc-tuplcs of k x k 0-1 matrices as 
follows. (M 1 , . . . , M k ) € A if and only if the minmax value for Player 1 in the 
k x k x k subgame given by (M 1 , . . . , M k ) is at most a. 

Define a 4-ary relation B having as first argument a fc-tuple of k x k 0-1 
matrices and with the last 3 arguments being indices from 1 to k as follows. 

((M\ . . . , M k ), 1, € B if and only if M-j = U\ 3 . 

The algorithm first computes the relations A and B. In the guessing steps the 
algorithm guesses a fc-tuple of matrices (M 1 , . . . , M k ) and indices i\, . . . , ik and 
ji, . . . ,jk- The final checks the algorithm must perform are (M 1 , . . . , M k ) e A 
and ((M 1 , . . . , M k ),l,i a , j b ) <E B for all l,a,be {1, ...,k}. The number of steps 
used for the guessing the indices and the final check is a function depending only 
on the parameter k as required. 

As discussed by Buss and Islam [2] we can in a generic way transform an 
algorithm utilizing a constant number of relations such that it only utilizes a 
single binary relation, thereby obtaining an aftp-algorithm showing that the 
decision problem is in W[V\. 



2.4 Proof of Theorem 4 



Before starting the proof, we remark that the reduction is based on similar ideas 
as the reduction proving NP-hardness of the problem by Borgs et al. However, 
they reduce from 3-coloring rather than clique, and in their coloring based games, 
we don't see how to restrict the strategy space of Player 1 to a small number of 
strategies, so as to obtain fixed-parameter intractability. 

We now describe the reduction. Given an undirected graph G = (V, E), with 
|V| = n. We construct a 2k x kn x kn game from G and compute the threat 
value to within l/(4fc 2 ). We construct the game in the following way. Let A\ = 
{1, . . . , k} x {2, 3} be the strategy space of Player 1 and A^ = A^ = {1, . . . , k} xV 
be the strategy space of Player 2 and Player 3. We define the payoff of Player 1 
as: 



(x 2 ,v 2 ), (x 3 ,v 3 )) 



1 if xi = Xi 

1 if X2 = X3 and v 2 7^ v 3 

1 if X2 7^ X3 and v 2 — v 3 

1 if v 2 7^ v 3 and (^2,^3) ^ E 

otherwise 



Player 2 and Player 3 will try to minimize the payoff of Player 1, hence we shall 
call them bullies. One can think of the game as the bullies each choosing a label 
and a vertex of G. Player 1 then chooses one of the bullies and tries to guess his 
label. If he guesses correctly he will get a payoff of 1. If not, he will get a payoff 
of 0, unless the bullies do one of the following: 



(i) Choose the same label, but different vertices. 

(ii) Choose different labels, but the same vertex. 

(iii) Choose a pair of distinct vertices that does not correspond to an edge. 

In which case he will get a payoff of 1 . The intuition behind the proof is that the 
bullies will be able to avoid these three cases if the graph contains a fc-clique, 
thereby better punishing Player 1. 

We first notice that if G contains a k-clique, the bullies can bring down 
the payoff of Player 1 to 1/fc by choosing a vertex from the k-clique uniformly 
at random and agreeing on a labeling of the vertices. Let p max be the highest 
probability that any of the bullies choose any label j € {1, . . . , fc}. Player 1 will, 
on the other hand, always be able to get a payoff of p max by choosing j and the 
corresponding player. It follows that the threat value of the game is 1/fc when 
G contains a k-clique. 

Next, we will show that if G contains no k-clique, the bullies will at most 
be able to bring down the payoff of Player 1 to 1/fc + l/(4fc 2 ). We will do a 
proof by contradiction, so lets assume that the bullies can force Player 1 to get 
a payoff less than 1/fc + l/(4fc 2 ) and show that this can never be the case. We 
have already seen that in this case p mayL < 1/fc + l/(4fc 2 ). 

Consider the case where Player 1 always chooses Player 2 and guesses a label 
uniformly at random. In this case Player 1 will always guess the correct label 
with probability 1/fc independently of the actions of the bullies. Hence, we need 
to show that for p being the probability of (i), (ii) or (iii) happening we have 
that: 



In particular the probability of (i) happening is less than l/(4fc(fc — 1)). Let 
Pmin > 1 — (fc — l)pmax > 3/ (4fc) + l/(4fc 2 ) be the minimum probability assigned 
to any label by either of the bullies. Let (x, v)i denote that Player i chooses the 
label x and the vertex v. We will use • as a wildcard, such that (x, -)i means that 
Player i chooses the label x and (•, v)i means that Player i chooses the vertex v. 




For v, w e V we see that: 
1 



> Pr[x 2 = X3 and v 2 ^ v 3 ] 



Ak{k- 1) 

= E E Pr [('^)2 I (J, O2] Pr[(j, -) 2 ] Pr [(-, w) 3 I (j, -)s] Pr[(j. Os] 

>fminEE Pr [('^) 2 I (J.O2] Pr[M3 I (.V) 3 ] 

J — 1 

= i£i»£ (l-E Pr [('^) 2 I 0V) 2 ] Pr[(-,«) 3 I Cj.-)s]) 
j=i \ vev / 

fe 

= fcPmin-P 1 2 „i„EE Pr [(^)2 I 0V) 2 ] Pr[(-,1>) 3 I 0V) 3 ]. 

This implies that V/ e {1, . . . , k} : 

^Pr[(, W ) 2 |(/,.) 2 ] Pr[(, W ) 3 |(/,-) 3 ] 

> fc - 4 fcffc-iv - E E Pr [°' w)a 1 (i ' ^ Pr [°' w)3 1 (j ' ^ 



> 1 - Tm > 1 - 



1 



4fc(fc-lR in 4fc(^l)(i + ^) 2 

> i for k > 5 



(fc-l)(3fc+l) 2 2' 



Let v\ be the vertex chosen with highest probability by player i given that he 
chooses label j. The above implies that for k > 5 we have for all j € {1, . . . , k} 
that «2 = v% , and we will therefore simply refer to this vertex as . We also see 
that for k > 5: 



V* e {2, 3}, Vj e {1, ...,*} : Pr[(-, ^ | (j, •),] > ^ 

That is, for every label j the bullies agree on some vertex that they choose 
with high probability when choosing j. We will use this to get a contradiction 
from case (ii) or (iii). For j, I e {1, . . . , k}, it will cither be the case that there 
exists some = v l , with j ^ I, or that all the v^'s are distinct. In the first case 
Player 1 will, with high probability, get a payoff of 1 when one of the bullies 
chooses label j and the other chooses label I (case (ii)). In the second case there 
will exist a pair of distinct labels j and I, such that there is no edge between v j 
and v , since the graph doesn't contain a k-clique. Hence, this will cause Player 
1 to get a payoff of 1, with high probability, when one of the bullies chooses label 
j and the other chooses label I (case (iii)). In both cases we get that for k > 5: 



£)Pr[(.y) 2+j I 0',-)2+i] Pr[(j,-)2+i] Pr[(-,«') 3 -i I Os-i] Pr[(i,-)3- 

i=0 

2 1 / q i \ 2 / QI , , ^2 



IV 1/3 1 \ (3fc + l) 2 1 



2 \4fc 4fcV 32fc 4 4fc(fc- 1) 

Which contradicts that this should happen with probability less than l/(4fc(fc— 
1)), meaning that if G contains no k-cliquc the bullies will at most be able to 
bring down the payoff of Player 1 to 1/fc + l/(4fc 2 ), which completes the reduc- 
tion. 



3 Conclusions and open problems 

As mentioned above, an important open problem is achieving a non-trivial ap- 
proximation of the minmax value of an n x n x n game in polynomial, rather than 
quasi-polynomial time. Another interesting question comes from the following 
notions: The threat point of a game is defined to be the vector of minmax values 
for each of its three players. We may consider approximating the threat point of 
a game where one of the players has few strategies. For this, we have to consider 
the problem of approximating the threat value for Player 1 in a three-player 
n x k x n game. That is, it is now one of the two "bullies" rather than the 
threatened player that has few strategies. We observe that for constant e > 
such an approximation can be done efficiently by simply discretizing the mixed 
strategy space of the player with few strategies using a lattice with all simplex 
points having distance at most e to some lattice point, and then for each lattice 
point solving the game for the remaining two players using linear programming. 
Combining this with Theorem 3 gives us the following corollary 

Corollary 1. There is an algorithm that, given an n x k x n game with 0-1 
payoffs and an e > 0, computes the threat point within additive error e in time 
(n/e)° (k l 

The discretization technique gives algorithms with poor dependence on the de- 
sired additive approximation e. We leave as an open problem if the minmax value 
of an n x k x n 0-1 game can be approximated within e in time (nlog(l/e))°( fc ). 
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Appendix A: Parameterized complexity 

For the convenience of the reader, we summarize notions from the theory of 
parameterized complexity used in this paper. 

Definition 1. A parameterized problem is a set of pairs Q C S* x S* , where 
S is a finite alphabet. The second coordinate is the parameter. 

We may think of Q as being a subset of S* x N, and we will do so henceforth. 

Definition 2. A parameterized problem Q C £*xN is fixed parameter tractable, 
if there is a computable function f : N — ► N, a polynomial p and an algorithm 
that given (x, k) C S* x N decides if (x, k) £ Q in at most f(k)p(\x\) steps. 

Let FPT be the class of parameterized problems that are fixed parameter tractable. 
To compare the complexity of problems that are not fixed parameter tractable 
the notion of fixed parameter reductions are introduced. 

Definition 3. An fpt-reduction from a parameterized problem Q C E* x N to 
a parameterized problem Q' C (£')* x N is a function R : S* x N — > (X")* x N 
such that 

1. For all (x, k) £ S* x N we have (x, k) £ Q if and only if R(x, k) £ Q' . 

2. There exists a computable function g : N — > N such that for all (x, k) £ 
S* x N we have k' < g{k), where R(x, k) = (x 1 ' , k'). 

3. There exists a computable function f : N — > N and a polynomial p such that 
R can be computed in at most f(k)p(\x\) steps. 

A hierarchy of parameterized complexity classes (the W hierarchy) 

FPT C W[l] C W[2] C • • • C W[SAT] C W[P] 

was defined by Downey and Fellows [7]. The classes W[t] were defined by the 
class of problems fpt-reduciblc to the weighted satisfiability problem for con- 
stant depth, so-called weft-t circuits. The class VF[SAT] is the class of problems 
reducible to the weighted satisfiability problem for Boolean formulas, and finally 
W[P] is the class of problems reducible to the weighted satisfiability problem for 
Boolean circuits. 

Chen, Flum and Grohc [5] gave a machine based characterization of all classes 
in the W hierarchy, except VF[SAT], that we will use instead. The remainder of 
this section is a description and statement of this characterization. 

A WRAM is an alternating random access machine. Its registers are parti- 
tioned into two sets: standard registers ro, ri, . . . , and guess registers go,gi, . . ■ ■ 
The guess registers will be indirectly indexed by the standard registers. We write 
g(ri) for g Ti . For accessing the standard registers the machine has all the instruc- 
tions of a standard deterministic random access machine. For accessing the guess 
registers the machine has the following instructions. 



Instruction 


Semantics 


EXISTS t j 
FORALL t j 
JGEQUAL i j c 
JGZERO i j c 


existentially guess a natural number < ro and store it in g{rj). 
universally guess a natural number < r and store it in g(rj). 
if g(ri) = g(tj) jump to the instruction with label c. 
if r (g{r i ),g(r i )) = jump to the instruction with label c. 



Here (•, •) is any simple pairing function of natural numbers with < (1 + 
max(i,j)) 2 and (0,0) = 0. 

A program P for a WRAM is an afpt-program, if there exists a computable 
function / and a polynomial p such that for every input (x, k) and every possible 
run, the program 

— performs at most f(k)p(\x\) steps. 

— performs at most f(k) existential and universal steps. 

— accesses only the first f(k)p(\x\) standard registers. 

— only stores numbers < f{k)p{\x\) in any register. 

With these definitions we have the following characterization of the classes W[t] 
and ^[F]. 

Theorem 6. A parameterized problem Q is in W[P] if and only if there is an 
aftp-program without universal instructions deciding Q. This also holds if we 
allow the aftp-program direct access to the guess registers. 

Theorem 7. Let Q be a parameterized problem and t > 1. Then Q is in W[t] 
if and only if there is a computable function h, a natural number u and an aftp- 
program deciding Q such that every run of the program on any given input (x, k) 
satisfies. 

— All nondeterministic steps are among the last h(k) steps of the computation. 

— The first nondeterministic step is existential. 

— There are at most t—1 alternations between existential and universal states. 

— Any sequence of steps without alternations, except the first existential se- 
quence, contains at most u nondeterministic steps. 



